PICUPP: Protein Interaction Classification by Unlikely Profile Pair

نویسندگان

  • Byung-Hoon Park
  • George Ostrouchov
  • Gong-Xin Yu
  • Al Geist
  • Andrey Gorin
  • Nagiza F. Samatova
چکیده

A computational approach that infers protein-protein interactions from genome sequences is proposed in this paper. It is based on our recent observation that protein-protein interactions can be identified by a set of “unusual” protein-profile pairs in experimentally determined protein interactions. A pair of proteinprofiles is considered to be unusual if its occurrence in the given data is statistically unusual from what is expected at random. The proposed method, called PICUPP, sifts out unusual protein-profile pairs by comparing frequency distributions of their occurrences in the given data to what may result from random appearances. It is demonstrated that such unusual protein-profile pairs with statistically assessed confidences can be learned efficiently from the DIP database using a bootstrapping approach. We particularly illustrate that unusual (or, significant) protein-profile pairs can be characterized as pair wise interactions between the Pfam domains, Blocks protein families, or InterPro signatures. Such statistically significant protein-profile pairs can be used for predicting putative pairs of interacting proteins. Their prediction accuracy is around 86% and 90% when InterPro and Pfam profiles are used, respectively at 75% confidence level.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Inference of Protein-Protein Interactions by Unlikely Profile Pair

We note that a set of statistically “unusual” proteinprofile pairs in experimentally determined database of protein-protein interactions can typify protein-protein interactions, and propose a novel method called PICUPP that sifts such protein-profile pairs using a statistical simulation. It is demonstrated that unusual Pfam and InterPro profile pairs can be extracted from the DIP database using...

متن کامل

Protein-Protein Interaction Analysis of Common Top Genes in Obsessive-Compulsive disorder (OCD) and Schizophrenia: Towards New Drug Approach

Comorbidty is common among psychiatric disorders including obsessive-compulsive disorder and schizophrenia with a high rate. Many studies suggested that the disorders may have same etiological bases. In this regard, shared pathways of glutamate, dopaminergic, and serotonin are the known ones. Here, the common significant genes are examined to understand the possible molecular origin of the diso...

متن کامل

Protein-Protein Interaction Analysis of Common Top Genes in Obsessive-Compulsive disorder (OCD) and Schizophrenia: Towards New Drug Approach

Comorbidty is common among psychiatric disorders including obsessive-compulsive disorder and schizophrenia with a high rate. Many studies suggested that the disorders may have same etiological bases. In this regard, shared pathways of glutamate, dopaminergic, and serotonin are the known ones. Here, the common significant genes are examined to understand the possible molecular origin of the diso...

متن کامل

Prediction of Protein Sub-Mitochondria Locations Using Protein Interaction Networks

Background: Prediction of the protein localization is among the most important issues in the bioinformatics that is used for the prediction of the proteins in the cells and organelles such as mitochondria. In this study, several machine learning algorithms are applied for the prediction of the intracellular protein locations. These algorithms use the features extracted from pro...

متن کامل

Investigating the effects of ibuprofen on the gene expression profile in Hippocampus of mice model of Alzheimer’s disease through bioinformatics analysis

Non-steroidal anti-inflammatory drugs (NSAIDs) identified effective in many diseases. One of which is neurodegenerative diseases including Alzheimer disease (AD). In this study gross alteration of gene expression in AD mice by ibuprofen treatment is investigated via Protein-protein interaction network (PPI) analysis. Expression profiling of microarray dataset GSE67306 was retrieved from GEO dat...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003